Unsupervised identification of useful visual landmarks using multiple segmentations and top-down feedback
نویسندگان
چکیده
In this paper, we tackle the problem of unsupervised selection and posterior recognition of visual landmarks in images sequences acquired by an indoor mobile robot. This is a highly valuable perceptual capability for a wide variety of robotic applications, in particular autonomous navigation. Our method combines a bottom-up data driven approach with top-down feedback provided by high level semantic representations. The bottom-up approach is based on three main mechanisms: visual attention, area segmentation, and landmark characterization. As there is no segmentation method that works properly in every situation, we integrate multiple segmentation algorithms in order to increase the robustness of the approach. In terms of the top-down feedback, this is provided by two information sources: i) An estimation of the robot position that reduces the searching scope for potential matches with previously selected landmarks, ii) A set of weights that, according to the results of previous recognitions, controls the influence of each segmentation algorithm in the recognition of each landmark. We test our approach with encouraging results in three datasets corresponding to real-world scenarios.
منابع مشابه
Automatic Selection and Detection of Visual Landmarks Using Multiple Segmentations
Detection of visual landmarks is an important problem in the development of automated, vision-based agents working on unstructured environments. In this paper, we present an unsupervised approach to select and to detect landmarks in images coming from a video stream. Our approach integrates three main visual mechanisms: attention, area segmentation, and landmark characterization. In particular,...
متن کاملObject Recognition by Integrating Multiple Image Segmentations
The joint tasks of object recognition and object segmentation from a single image are complex in their requirement of not only correct classification, but also deciding exactly which pixels belong to the object. Exploring all possible pixel subsets is prohibitively expensive, leading to recent approaches which use unsupervised image segmentation to reduce the size of the configuration space. Im...
متن کاملTop-down Attention Supports Visual Loop Closing
In this paper, we present a method to improve the loop closing behaviour for visual SLAM. Landmarks consist of a combination of attention regions and Harris-Laplace corners. The attention regions are detected by a visual attention system which combines image-based, bottom-up and target-related, topdown information. The ability to perform target-directed search is used to search for expected lan...
متن کاملA Feedback Model of Perceptual Learning and Categorisation
Top-down, feedback, influences are known to have significant effects on visual information processing. Such influences are also likely to affect perceptual learning. This article employs a computational model of the cortical region interactions underlying visual perception to investigate possible influences of top-down information on learning. The results suggest that feedback could bias the wa...
متن کاملCommon Landmark Discovery in Urban Scenes
In this paper, we introduce a method for unsupervised discovery of landmark objects in urban scenes. Unlike existing methods that deal with pre-defined and typically popular landmarks whose models or training data are available on the internet (e.g. Nortre Dame de Paris, Leaning tower of Pisa, etc.), we develop a new unified framework, common landmark discovery (CLD), which discovers even a pri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Robotics and Autonomous Systems
دوره 56 شماره
صفحات -
تاریخ انتشار 2008